Universal Knowledge Repository

ABSTRACT

A system comprising: a processor; a data bus coupled to the processor; and a non-transitory, computer-readable storage medium embodying computer program code, the non-transitory, computer-readable storage medium being coupled to the data bus. The computer program code interacting with a plurality of computer operations and comprising instructions executable by the processor and configured for: receiving streams of data from a plurality of data sources; processing the streams of data from the plurality of data sources, the processing the streams of data from the plurality of data sources performing data enriching to provide data enriched data streams; and, storing the data streams and the data enriched data streams within the universal knowledge repository as a collection of knowledge elements.

CROSS-REFERENCE TO RELATED APPLICATION

This application claims the benefit under 35 U.S.C. §119(e) of U.S.Provisional Application No. 62/009,626, filed Jun. 9, 2014, entitled“Cognitive Information Processing System Environment.” U.S. ProvisionalApplication No. 62/009,626 includes exemplary systems and methods and isincorporated by reference in its entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates in general to the field of computers andsimilar technologies, and in particular to software utilized in thisfield. Still more particularly, it relates to a method, system andcomputer-usable medium for generating and using a universal knowledgerepository.

2. Description of the Related Art

In general, “big data” refers to a collection of datasets so large andcomplex that they become difficult to process using typical databasemanagement tools and traditional data processing approaches. Thesedatasets can originate from a wide variety of sources, includingcomputer systems, mobile devices, credit card transactions, televisionbroadcasts, and medical equipment, as well as infrastructures associatedwith cities, sensor-equipped buildings and factories, and transportationsystems. Challenges commonly associated with big data, which may be acombination of structured, unstructured, and semi-structured data,include its capture, curation, storage, search, sharing, analysis andvisualization. In combination, these challenges make it difficult toefficiently process large quantities of data within tolerable timeintervals.

Nonetheless, big data analytics hold the promise of extracting insightsby uncovering difficult-to-discover patterns and connections, as well asproviding assistance in making complex decisions by analyzing differentand potentially conflicting options. As such, individuals andorganizations alike can be provided new opportunities to innovate,compete, and capture value.

One aspect of big data is “dark data,” which generally refers to datathat is either not collected, neglected, or underutilized. Examples ofdata that is not currently being collected includes location data priorto the emergence of companies such as Foursquare or social data prior tothe advent companies such as Facebook. An example of data that is beingcollected, but is difficult to access at the right time and place,includes data associated with the side effects of certain spider biteswhile on a camping trip. As another example, data that is collected andavailable, but has not yet been productized of fully utilized, mayinclude disease insights from population-wide healthcare records andsocial media feeds. As a result, a case can be made that dark data mayin fact be of higher value than big data in general, especially as itcan likely provide actionable insights when it is combined withreadily-available data.

SUMMARY OF THE INVENTION

A method, system and computer-usable medium are disclosed for cognitiveinference and learning operations.

In one embodiment, the invention relates to a system comprising: aprocessor; a data bus coupled to the processor; and a non-transitory,computer-readable storage medium embodying computer program code, thenon-transitory, computer-readable storage medium being coupled to thedata bus. The computer program code interacting with a plurality ofcomputer operations and comprising instructions executable by theprocessor and configured for: receiving streams of data from a pluralityof data sources; processing the streams of data from the plurality ofdata sources, the processing the streams of data from the plurality ofdata sources performing data enriching to provide data enriched datastreams; and, storing the data streams and the data enriched datastreams within the universal knowledge repository as a collection ofknowledge elements.

In another embodiment, the invention relates to a non-transitory,computer-readable storage medium embodying computer program code, thecomputer program code comprising computer executable instructionsconfigured for: receiving streams of data from a plurality of datasources; processing the streams of data from the plurality of datasources, the processing the streams of data from the plurality of datasources performing data enriching to provide data enriched data streams;and, storing the data streams and the data enriched data streams withinthe universal knowledge repository as a collection of knowledgeelements.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention may be better understood, and its numerousobjects, features and advantages made apparent to those skilled in theart by referencing the accompanying drawings. The use of the samereference number throughout the several figures designates a like orsimilar element.

FIG. 1 depicts an exemplary client computer in which the presentinvention may be implemented;

FIG. 2 is a simplified block diagram of a cognitive inference andlearning system (CILS);

FIG. 3 is a simplified block diagram of a CILS reference modelimplemented in accordance with an embodiment of the invention;

FIGS. 4 a through 4 c depict additional components of the CILS referencemodel shown in FIG. 3;

FIG. 5 is a simplified process diagram of CILS operations;

FIG. 6 depicts the lifecycle of CILS agents implemented to perform CILSoperations;

FIG. 7 is a simplified block diagram of a plurality of cognitiveplatforms implemented in a hybrid cloud environment;

FIGS. 8 a and 8 b show a simplified process flow diagram of compositecognitive insight generation and feedback operations; and

FIG. 9 is a generalized flowchart of the performance of compositecognitive insight generation and feedback operations.

DETAILED DESCRIPTION

A method, system and computer-usable medium are disclosed for cognitiveinference and learning operations. The present invention may be asystem, a method, and/or a computer program product. The computerprogram product may include a computer readable storage medium (ormedia) having computer readable program instructions thereon for causinga processor to carry out aspects of the present invention.

The computer readable storage medium can be a tangible device that canretain and store instructions for use by an instruction executiondevice. The computer readable storage medium may be, for example, but isnot limited to, an electronic storage device, a magnetic storage device,an optical storage device, an electromagnetic storage device, asemiconductor storage device, or any suitable combination of theforegoing. A non-exhaustive list of more specific examples of thecomputer readable storage medium includes the following: a portablecomputer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a static random access memory (SRAM), a portablecompact disc read-only memory (CD-ROM), a digital versatile disk (DVD),a memory stick, a floppy disk, a mechanically encoded device such aspunch-cards or raised structures in a groove having instructionsrecorded thereon, and any suitable combination of the foregoing. Acomputer readable storage medium, as used herein, is not to be construedas being transitory signals per se, such as radio waves or other freelypropagating electromagnetic waves, electromagnetic waves propagatingthrough a waveguide or other transmission media (e.g., light pulsespassing through a fiber-optic cable), or electrical signals transmittedthrough a wire.

Computer readable program instructions described herein can bedownloaded to respective computing/processing devices from a computerreadable storage medium or to an external computer or external storagedevice via a network, for example, the Internet, a local area network, awide area network and/or a wireless network. The network may comprisecopper transmission cables, optical transmission fibers, wirelesstransmission, routers, firewalls, switches, gateway computers and/oredge servers. A network adapter card or network interface in eachcomputing/processing device receives computer readable programinstructions from the network and forwards the computer readable programinstructions for storage in a computer readable storage medium withinthe respective computing/processing device.

Computer readable program instructions for carrying out operations ofthe present invention may be assembler instructions,instruction-set-architecture (ISA) instructions, machine instructions,machine dependent instructions, microcode, firmware instructions,state-setting data, or either source code or object code written in anycombination of one or more programming languages, including an objectoriented programming language such as Smalltalk, C++ or the like, andconventional procedural programming languages, such as the “C”programming language or similar programming languages. The computerreadable program instructions may execute entirely on the user'scomputer, partly on the user's computer, as a stand-alone softwarepackage, partly on the user's computer and partly on a remote computeror entirely on the remote computer or server. In the latter scenario,the remote computer may be connected to the user's computer through anytype of network, including a local area network (LAN) or a wide areanetwork (WAN), or the connection may be made to an external computer(for example, through the Internet using an Internet Service Provider).In some embodiments, electronic circuitry including, for example,programmable logic circuitry, field-programmable gate arrays (FPGA), orprogrammable logic arrays (PLA) may execute the computer readableprogram instructions by utilizing state information of the computerreadable program instructions to personalize the electronic circuitry,in order to perform aspects of the present invention.

Aspects of the present invention are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer readable program instructions.

These computer readable program instructions may be provided to aprocessor of a general purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. These computer readable program instructionsmay also be stored in a computer readable storage medium that can directa computer, a programmable data processing apparatus, and/or otherdevices to function in a particular manner, such that the computerreadable storage medium having instructions stored therein comprises anarticle of manufacture including instructions which implement aspects ofthe function/act specified in the flowchart and/or block diagram blockor blocks.

The computer readable program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational steps to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the present invention. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof instructions, which comprises one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the block may occur out of theorder noted in the figures. For example, two blocks shown in successionmay, in fact, be executed substantially concurrently, or the blocks maysometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

FIG. 1 is a generalized illustration of an information processing system100 that can be used to implement the system and method of the presentinvention. The information processing system 100 includes a processor(e.g., central processor unit or “CPU”) 102, input/output (I/O) devices104, such as a display, a keyboard, a mouse, and associated controllers,a hard drive or disk storage 106, and various other subsystems 108. Invarious embodiments, the information processing system 100 also includesnetwork port 110 operable to connect to a network 140, which is likewiseaccessible by a service provider server 142. The information processingsystem 100 likewise includes system memory 112, which is interconnectedto the foregoing via one or more buses 114. System memory 112 furthercomprises operating system (OS) 116 and in various embodiments may alsocomprise cognitive inference and learning system (CILS) 118. In theseand other embodiments, the CILS 118 may likewise comprise inventionmodules 120. In one embodiment, the information processing system 100 isable to download the CILS 118 from the service provider server 142. Inanother embodiment, the CILS 118 is provided as a service from theservice provider server 142.

In various embodiments, the CILS 118 is implemented to perform variouscognitive computing operations described in greater detail herein. Asused herein, cognitive computing broadly refers to a class of computinginvolving self-learning systems that use techniques such as spatialnavigation, machine vision, and pattern recognition to increasinglymimic the way the human brain works. To be more specific, earlierapproaches to computing typically solved problems by executing a set ofinstructions codified within software. In contrast, cognitive computingapproaches are data-driven, sense-making, insight-extracting,problem-solving systems that have more in common with the structure ofthe human brain than with the architecture of contemporary,instruction-driven computers.

To further differentiate these distinctions, traditional computers mustfirst be programmed by humans to perform specific tasks, while cognitivesystems learn from their interactions with data and humans alike, and ina sense, program themselves to perform new tasks. To summarize thedifference between the two, traditional computers are designed tocalculate rapidly. Cognitive systems are designed to quickly drawinferences from data and gain new knowledge.

Cognitive systems achieve these abilities by combining various aspectsof artificial intelligence, natural language processing, dynamiclearning, and hypothesis generation to render vast quantities ofintelligible data to assist humans in making better decisions. As such,cognitive systems can be characterized as having the ability to interactnaturally with people to extend what either humans, or machines, coulddo on their own. Furthermore, they are typically able to process naturallanguage, multi-structured data, and experience much in the same way ashumans. Moreover, they are also typically able to learn a knowledgedomain based upon the best available data and get better, and moreimmersive, over time.

It will be appreciated that more data is currently being produced everyday than was recently produced by human beings from the beginning ofrecorded time. Deep within this ever-growing mass of data is a class ofdata known as “dark data,” which includes neglected information, ambientsignals, and insights that can assist organizations and individuals inaugmenting their intelligence and deliver actionable insights throughthe implementation of cognitive applications. As used herein, cognitiveapplications, or “cognitive apps,” broadly refer to cloud-based, bigdata interpretive applications that learn from user engagement and datainteractions. Such cognitive applications extract patterns and insightsfrom dark data sources that are currently almost completely opaque.Examples of such dark data include disease insights from population-widehealthcare records and social media feeds, or from new sources ofinformation, such as sensors monitoring pollution in delicate marineenvironments.

Over time, it is anticipated that cognitive applications willfundamentally change the ways in which many organizations operate asthey invert current issues associated with data volume and variety toenable a smart, interactive data supply chain. Ultimately, cognitiveapplications hold the promise of receiving a user query and immediatelyproviding a data-driven answer from a masked data supply chain inresponse. As they evolve, it is likewise anticipated that cognitiveapplications may enable a new class of “sixth sense” applications thatintelligently detect and learn from relevant data and events to offerinsights, predictions and advice rather than wait for commands. Just asweb and mobile applications changed the way people access data,cognitive applications may change the way people listen to, and becomeempowered by, multi-structured data such as emails, social media feeds,doctors notes, transaction records, and call logs.

However, the evolution of such cognitive applications has associatedchallenges, such as how to detect events, ideas, images, and othercontent that may be of interest. For example, assuming that the role andpreferences of a given user are known, how is the most relevantinformation discovered, prioritized, and summarized from large streamsof multi-structured data such as news feeds, blogs, social media,structured data, and various knowledge bases? To further the example,what can a healthcare executive be told about their competitor's marketshare? Other challenges include the creation of acontextually-appropriate visual summary of responses to questions orqueries.

FIG. 2 is a simplified block diagram of a cognitive inference andlearning system (CILS) implemented in accordance with an embodiment ofthe invention. In various embodiments, the CILS 118 is implemented toincorporate a variety of processes, including semantic analysis 202,goal optimization 204, collaborative filtering 206, common sensereasoning 208, natural language processing 210, summarization 212,temporal/spatial reasoning 214, and entity resolution 216 to generatecognitive insights.

As used herein, semantic analysis 202 broadly refers to performingvarious analysis operations to achieve a semantic level of understandingabout language by relating syntactic structures. In various embodiments,various syntactic structures are related from the levels of phrases,clauses, sentences and paragraphs, to the level of the body of contentas a whole and to its language-independent meaning. In certainembodiments, the semantic analysis 202 process includes processing atarget sentence to parse it into its individual parts of speech, tagsentence elements that are related to predetermined items of interest,identify dependencies between individual words, and perform co-referenceresolution. For example, if a sentence states that the author reallylikes the hamburgers served by a particular restaurant, then the name ofthe “particular restaurant” is co-referenced to “hamburgers.”

As likewise used herein, goal optimization 204 broadly refers toperforming multi-criteria decision making operations to achieve a givengoal or target objective. In various embodiments, one or more goaloptimization 204 processes are implemented by the CILS 118 to definepredetermined goals, which in turn contribute to the generation of acognitive insight. For example, goals for planning a vacation trip mayinclude low cost (e.g., transportation and accommodations), location(e.g., by the beach), and speed (e.g., short travel time). In thisexample, it will be appreciated that certain goals may be in conflictwith another. As a result, a cognitive insight provided by the CILS 118to a traveler may indicate that hotel accommodations by a beach may costmore than they care to spend.

Collaborative filtering 206, as used herein, broadly refers to theprocess of filtering for information or patterns through thecollaborative involvement of multiple agents, viewpoints, data sources,and so forth. The application of such collaborative filtering 206processes typically involves very large and different kinds of datasets, including sensing and monitoring data, financial data, and userdata of various kinds Collaborative filtering 206 may also refer to theprocess of making automatic predictions associated with predeterminedinterests of a user by collecting preferences or other information frommany users. For example, if person ‘A’ has the same opinion as a person‘B’ for a given issue ‘x’, then an assertion can be made that person ‘A’is more likely to have the same opinion as person ‘B’ opinion on adifferent issue ‘y’ than to have the same opinion on issue ‘y’ as arandomly chosen person. In various embodiments, the collaborativefiltering 206 process is implemented with various recommendation enginesfamiliar to those of skill in the art to make recommendations.

As used herein, common sense reasoning 208 broadly refers to simulatingthe human ability to make deductions from common facts they inherentlyknow. Such deductions may be made from inherent knowledge about thephysical properties, purpose, intentions and possible behavior ofordinary things, such as people, animals, objects, devices, and so on.In various embodiments, common sense reasoning 208 processes areimplemented to assist the CILS 118 in understanding and disambiguatingwords within a predetermined context. In certain embodiments, the commonsense reasoning 208 processes are implemented to allow the CILS 118 togenerate text or phrases related to a target word or phrase to performdeeper searches for the same terms. It will be appreciated that if thecontext of a word is better understood, then a common senseunderstanding of the word can then be used to assist in finding betteror more accurate information. In certain embodiments, this better ormore accurate understanding of the context of a word, and its relatedinformation, allows the CILS 118 to make more accurate deductions, whichare in turn used to generate cognitive insights.

As likewise used herein, natural language processing (NLP) 210 broadlyrefers to interactions with a system, such as the CILS 118, through theuse of human, or natural, languages. In various embodiments, various NLP210 processes are implemented by the CILS 118 to achieve naturallanguage understanding, which enables it to not only derive meaning fromhuman or natural language input, but to also generate natural languageoutput.

Summarization 212, as used herein, broadly refers to processing a set ofinformation, organizing and ranking it, and then generating acorresponding summary. As an example, a news article may be processed toidentify its primary topic and associated observations, which are thenextracted, ranked, and then presented to the user. As another example,page ranking operations may be performed on the same news article toidentify individual sentences, rank them, order them, and determinewhich of the sentences are most impactful in describing the article andits content. As yet another example, a structured data record, such as apatient's electronic medical record (EMR), may be processed using thesummarization 212 process to generate sentences and phrases thatdescribes the content of the EMR. In various embodiments, varioussummarization 212 processes are implemented by the CILS 118 to generatesummarizations of content streams, which are in turn used to generatecognitive insights.

As used herein, temporal/spatial reasoning 214 broadly refers toreasoning based upon qualitative abstractions of temporal and spatialaspects of common sense knowledge, described in greater detail herein.For example, it is not uncommon for a predetermined set of data tochange over time. Likewise, other attributes, such as its associatedmetadata, may likewise change over time. As a result, these changes mayaffect the context of the data. To further the example, the context ofasking someone what they believe they should be doing at 3:00 in theafternoon during the workday while they are at work may be quitedifferent that asking the same user the same question at 3:00 on aSunday afternoon when they are at home. In various embodiments, varioustemporal/spatial reasoning 214 processes are implemented by the CILS 118to determine the context of queries, and associated data, which are inturn used to generate cognitive insights.

As likewise used herein, entity resolution 216 broadly refers to theprocess of finding elements in a set of data that refer to the sameentity across different data sources (e.g., structured, non-structured,streams, devices, etc.), where the target entity does not share a commonidentifier. In various embodiments, the entity resolution 216 process isimplemented by the CILS 118 to identify significant nouns, adjectives,phrases or sentence elements that represent various predeterminedentities within one or more domains. From the foregoing, it will beappreciated that the implementation of one or more of the semanticanalysis 202, goal optimization 204, collaborative filtering 206, commonsense reasoning 208, natural language processing 210, summarization 212,temporal/spatial reasoning 214, and entity resolution 216 processes bythe CILS 118 can facilitate the generation of a semantic, cognitivemodel.

In various embodiments, the CILS 118 receives ambient signals 220,curated data 222, and learned knowledge, which is then processed by theCILS 118 to generate one or more cognitive graphs 226. In turn, the oneor more cognitive graphs 226 are further used by the CILS 118 togenerate cognitive insight streams, which are then delivered to one ormore destinations 230, as described in greater detail herein.

As used herein, ambient signals 220 broadly refer to input signals, orother data streams, that may contain data providing additional insightor context to the curated data 222 and learned knowledge 224 received bythe CILS 118. For example, ambient signals may allow the CILS 118 tounderstand that a user is currently using their mobile device, atlocation ‘x’, at time ‘y’, doing activity ‘z’. To further the example,there is a difference between the user using their mobile device whilethey are on an airplane versus using their mobile device after landingat an airport and walking between one terminal and another. To extendthe example even further, ambient signals may add additional context,such as the user is in the middle of a three leg trip and has two hoursbefore their next flight. Further, they may be in terminal A1, but theirnext flight is out of C1, it is lunchtime, and they want to know thebest place to eat. Given the available time the user has, their currentlocation, restaurants that are proximate to their predicted route, andother factors such as food preferences, the CILS 118 can perform variouscognitive operations and provide a recommendation for where the user caneat.

In various embodiments, the curated data 222 may include structured,unstructured, social, public, private, streaming, device or other typesof data described in greater detail herein. In certain embodiments, thelearned knowledge 224 is based upon past observations and feedback fromthe presentation of prior cognitive insight streams and recommendations.In various embodiments, the learned knowledge 224 is provided via afeedback look that provides the learned knowledge 224 in the form of alearning stream of data.

As likewise used herein, a cognitive graph 226 refers to arepresentation of expert knowledge, associated with individuals andgroups over a period of time, to depict relationships between people,places, and things using words, ideas, audio and images. As such, it isa machine-readable formalism for knowledge representation that providesa common framework allowing data and knowledge to be shared and reusedacross user, application, organization, and community boundaries.

In various embodiments, the information contained in, and referenced by,a cognitive graph 226 is derived from many sources (e.g., public,private, social, device), such as curated data 222. In certain of theseembodiments, the cognitive graph 226 assists in the identification andorganization of information associated with how people, places andthings are related to one other. In various embodiments, the cognitivegraph 226 enables automated agents, described in greater detail herein,to access the Web more intelligently, enumerate inferences throughutilization of curated, structured data 222, and provide answers toquestions by serving as a computational knowledge engine.

In certain embodiments, the cognitive graph 226 not only elicits andmaps expert knowledge by deriving associations from data, it alsorenders higher level insights and accounts for knowledge creationthrough collaborative knowledge modeling. In various embodiments, thecognitive graph 226 is a machine-readable, declarative memory systemthat stores and learns both episodic memory (e.g., specific personalexperiences associated with an individual or entity), and semanticmemory, which stores factual information (e.g., geo location of anairport or restaurant).

For example, the cognitive graph 226 may know that a given airport is aplace, and that there is a list of related places such as hotels,restaurants and departure gates. Furthermore, the cognitive graph 226may know that people such as business travelers, families and collegestudents use the airport to board flights from various carriers, eat atvarious restaurants, or shop at certain retail stores. The cognitivegraph 226 may also have knowledge about the key attributes from variousretail rating sites that travelers have used to describe the food andtheir experience at various venues in the airport over the past sixmonths.

In certain embodiments, the cognitive insight stream 228 isbidirectional, and supports flows of information both too and fromdestinations 230. In these embodiments, the first flow is generated inresponse to receiving a query, and subsequently delivered to one or moredestinations 230. The second flow is generated in response to detectinginformation about a user of one or more of the destinations 230. Suchuse results in the provision of information to the CILS 118. Inresponse, the CILS 118 processes that information, in the context ofwhat it knows about the user, and provides additional information to theuser, such as a recommendation. In various embodiments, the cognitiveinsight stream 228 is configured to be provided in a “push” streamconfiguration familiar to those of skill in the art. In certainembodiments, the cognitive insight stream 228 is implemented to usenatural language approaches familiar to skilled practitioners of the artto support interactions with a user.

In various embodiments, the cognitive insight stream 228 may include astream of visualized insights. As used herein, visualized insightsbroadly refers to cognitive insights that are presented in a visualmanner, such as a map, an infographic, images, and so forth. In certainembodiments, these visualized insights may include various cognitiveinsights, such as “What happened?”, “What do I know about it?”, “What islikely to happen next?”, or “What should I do about it?” In theseembodiments, the cognitive insight stream is generated by variouscognitive agents, which are applied to various sources, datasets, andcognitive graphs. As used herein, a cognitive agent broadly refers to acomputer program that performs a task with minimum specific directionsfrom users and learns from each interaction with data and human users.

In various embodiments, the CILS 118 delivers Cognition as a Service(CaaS). As such, it provides a cloud-based development and executionplatform that allow various cognitive applications and services tofunction more intelligently and intuitively. In certain embodiments,cognitive applications powered by the CILS 118 are able to think andinteract with users as intelligent virtual assistants. As a result,users are able to interact with such cognitive applications by askingthem questions and giving them commands. In response, these cognitiveapplications will be able to assist the user in completing tasks andmanaging their work more efficiently.

In these and other embodiments, the CILS 118 can operate as an analyticsplatform to process big data, and dark data as well, to provide dataanalytics through a public, private or hybrid cloud environment. As usedherein, cloud analytics broadly refers to a service model wherein datasources, data models, processing applications, computing power, analyticmodels, and sharing or storage of results are implemented within a cloudenvironment to perform one or more aspects of analytics.

In various embodiments, users submit queries and computation requests ina natural language format to the CILS 118. In response, they areprovided with a ranked list of relevant answers and aggregatedinformation with useful links and pertinent visualizations through agraphical representation. In these embodiments, the cognitive graph 226generates semantic and temporal maps to reflect the organization ofunstructured data and to facilitate meaningful learning from potentiallymillions of lines of text, much in the same way as arbitrary syllablesstrung together create meaning through the concept of language.

FIG. 3 is a simplified block diagram of a cognitive inference andlearning system (CILS) reference model implemented in accordance with anembodiment of the invention. In this embodiment, the CILS referencemodel is associated with the CILS 118 shown in FIG. 2. As shown in FIG.3, the CILS 118 includes client applications 302, applicationaccelerators 306, a cognitive platform 310, and cloud infrastructure340. In various embodiments, the client applications 302 includecognitive applications 304, which are implemented to understand andadapt to the user, not the other way around, by natively accepting andunderstanding human forms of communication, such as natural languagetext, audio, images, video, and so forth.

In these and other embodiments, the cognitive applications 304 possesssituational and temporal awareness based upon ambient signals from usersand data, which facilitates understanding the user's intent, content,context and meaning to drive goal-driven dialogs and outcomes. Further,they are designed to gain knowledge over time from a wide variety ofstructured, non-structured, and device data sources, continuouslyinterpreting and autonomously reprogramming themselves to betterunderstand a given domain. As such, they are well-suited to supporthuman decision making, by proactively providing trusted advice, offersand recommendations while respecting user privacy and permissions.

In various embodiments, the application accelerators 306 include acognitive application framework 308. In certain embodiments, theapplication accelerators 306 and the cognitive application framework 308support various plug-ins and components that facilitate the creation ofclient applications 302 and cognitive applications 304. In variousembodiments, the application accelerators 306 include widgets, userinterface (UI) components, reports, charts, and back-end integrationcomponents familiar to those of skill in the art.

As likewise shown in FIG. 3, the cognitive platform 310 includes amanagement console 312, a development environment 314, applicationprogram interfaces (APIs) 316, sourcing agents 318, a cognitive engine320, destination agents 336, and platform data 338, all of which aredescribed in greater detail herein. In various embodiments, themanagement console 312 is implemented to manage accounts and projects,along with user-specific metadata that is used to drive processes andoperations within the cognitive platform 310 for a predeterminedproject.

In certain embodiments, the development environment 314 is implementedto create custom extensions to the CILS 118 shown in FIG. 2. In variousembodiments, the development environment 314 is implemented for thedevelopment of a custom application, which may subsequently be deployedin a public, private or hybrid cloud environment. In certainembodiments, the development environment 314 is implemented for thedevelopment of a custom sourcing agent, a custom bridging agent, acustom destination agent, or various analytics applications orextensions.

In various embodiments, the APIs 316 are implemented to build and managepredetermined cognitive applications 304, described in greater detailherein, which are then executed on the cognitive platform 310 togenerate cognitive insights. Likewise, the sourcing agents 318 areimplemented in various embodiments to source a variety of multi-site,multi-structured source streams of data described in greater detailherein. In various embodiments, the cognitive engine 320 includes adataset engine 322, a graph query engine 326, an insight/learning engine330, and foundation components 334. In certain embodiments, the datasetengine 322 is implemented to establish and maintain a dynamic dataingestion and enrichment pipeline. In these and other embodiments, thedataset engine 322 may be implemented to orchestrate one or moresourcing agents 318 to source data. Once the data is sourced, the dataset engine 322 performs data enriching and other data processingoperations, described in greater detail herein, and generates one ormore sub-graphs that are subsequently incorporated into a targetcognitive graph.

In various embodiments, the graph query engine 326 is implemented toreceive and process queries such that they can be bridged into acognitive graph, as described in greater detail herein, through the useof a bridging agent. In certain embodiments, the graph query engine 326performs various natural language processing (NLP), familiar to skilledpractitioners of the art, to process the queries. In variousembodiments, the insight/learning engine 330 is implemented toencapsulate a predetermined algorithm, which is then applied to acognitive graph to generate a result, such as a cognitive insight or arecommendation. In certain embodiments, one or more such algorithms maycontribute to answering a specific question and provide additionalcognitive insights or recommendations. In various embodiments, two ormore of the dataset engine 322, the graph query engine 326, and theinsight/learning engine 330 may be implemented to operatecollaboratively to generate a cognitive insight or recommendation. Incertain embodiments, one or more of the dataset engine 322, the graphquery engine 326, and the insight/learning engine 330 may operateautonomously to generate a cognitive insight or recommendation.

The foundation components 334 shown in FIG. 3 include various reusablecomponents, familiar to those of skill in the art, which are used invarious embodiments to enable the dataset engine 322, the graph queryengine 326, and the insight/learning engine 330 to perform theirrespective operations and processes. Examples of such foundationcomponents 334 include natural language processing (NLP) components andcore algorithms, such as cognitive algorithms.

In various embodiments, the platform data 338 includes various datarepositories, described in greater detail herein, that are accessed bythe cognitive platform 310 to generate cognitive insights. In variousembodiments, the destination agents 336 are implemented to publishcognitive insights to a consumer of cognitive insight data. Examples ofsuch consumers of cognitive insight data include target databases,business intelligence applications, and mobile applications. It will beappreciated that many such examples of cognitive insight data consumersare possible and the foregoing is not intended to limit the spirit,scope or intent of the invention. In various embodiments, as describedin greater detail herein, the cloud infrastructure 340 includescognitive cloud management 342 components and cloud analyticsinfrastructure components 344.

FIGS. 4 a through 4 c depict additional cognitive inference and learningsystem (CILS) components implemented in accordance with an embodiment ofthe CILS reference model shown in FIG. 3. In this embodiment, the CILSreference model includes client applications 302, applicationaccelerators 306, a cognitive platform 310, and cloud infrastructure340. As shown in FIG. 4 a, the client applications 302 include cognitiveapplications 304. In various embodiments, the cognitive applications 304are implemented natively accept and understand human forms ofcommunication, such as natural language text, audio, images, video, andso forth. In certain embodiments, the cognitive applications 304 mayinclude healthcare 402, business performance 403, travel 404, andvarious other 405 applications familiar to skilled practitioners of theart. As such, the foregoing is only provided as examples of suchcognitive applications 304 and is not intended to limit the intent,spirit of scope of the invention.

In various embodiments, the application accelerators 306 include acognitive application framework 308. In certain embodiments, theapplication accelerators 308 and the cognitive application framework 308support various plug-ins and components that facilitate the creation ofclient applications 302 and cognitive applications 304. In variousembodiments, the application accelerators 306 include widgets, userinterface (UI) components, reports, charts, and back-end integrationcomponents familiar to those of skill in the art. It will be appreciatedthat many such application accelerators 306 are possible and theirprovided functionality, selection, provision and support are a matter ofdesign choice. As such, the application accelerators 306 described ingreater detail herein are not intended to limit the spirit, scope orintent of the invention.

As shown in FIGS. 4 a and 4 b, the cognitive platform 310 includes amanagement console 312, a development environment 314, applicationprogram interfaces (APIs) 316, sourcing agents 318, a cognitive engine320, destination agents 336, platform data 338, and a crawl framework452. In various embodiments, the management console 312 is implementedto manage accounts and projects, along with management metadata 461 thatis used to drive processes and operations within the cognitive platform310 for a predetermined project.

In various embodiments, the management console 312 is implemented to runvarious services on the cognitive platform 310. In certain embodiments,the management console 312 is implemented to manage the configuration ofthe cognitive platform 310. In certain embodiments, the managementconsole 312 is implemented to establish the development environment 314.In various embodiments, the management console 312 may be implemented tomanage the development environment 314 once it is established. Skilledpractitioners of the art will realize that many such embodiments arepossible and the foregoing is not intended to limit the spirit, scope orintent of the invention.

In various embodiments, the development environment 314 is implementedto create custom extensions to the CILS 118 shown in FIG. 2. In theseand other embodiments, the development environment 314 is implemented tosupport various programming languages, such as Python, Java, R, andothers familiar to skilled practitioners of the art. In variousembodiments, the development environment 314 is implemented to allow oneor more of these various programming languages to create a variety ofanalytic models and applications. As an example, the developmentenvironment 314 may be implemented to support the R programminglanguage, which in turn can be used to create an analytic model that isthen hosted on the cognitive platform 310.

In certain embodiments, the development environment 314 is implementedfor the development of various custom applications or extensions relatedto the cognitive platform 310, which may subsequently be deployed in apublic, private or hybrid cloud environment. In various embodiments, thedevelopment environment 314 is implemented for the development ofvarious custom sourcing agents 318, custom enrichment agents 425, custombridging agents 429, custom insight agents 433, custom destinationagents 336, and custom learning agents 434, which are described ingreater detail herein.

In various embodiments, the APIs 316 are implemented to build and managepredetermined cognitive applications 304, described in greater detailherein, which are then executed on the cognitive platform 310 togenerate cognitive insights. In these embodiments, the APIs 316 mayinclude one or more of a project and dataset API 408, a cognitive searchAPI 409, a cognitive insight API 410, and other APIs. The selection ofthe individual APIs 316 implemented in various embodiments is a matterdesign choice and the foregoing is not intended to limit the spirit,scope or intent of the invention.

In various embodiments, the project and dataset API 408 is implementedwith the management console 312 to enable the management of a variety ofdata and metadata associated with various cognitive insight projects anduser accounts hosted or supported by the cognitive platform 310. In oneembodiment, the data and metadata managed by the project and dataset API408 are associated with billing information familiar to those of skillin the art. In one embodiment, the project and dataset API 408 is usedto access a data stream that is created, configured and orchestrated, asdescribed in greater detail herein, by the dataset engine 322.

In various embodiments, the cognitive search API 409 uses naturallanguage processes familiar to those of skill in the art to search atarget cognitive graph. Likewise, the cognitive insight API 410 isimplemented in various embodiments to configure the insight/learningengine 330 to provide access to predetermined outputs from one or morecognitive graph algorithms that are executing in the cognitive platform310. In certain embodiments, the cognitive insight API 410 isimplemented to subscribe to, or request, such predetermined outputs.

In various embodiments, the sourcing agents 318 may include a batchupload 414 agent, an API connectors 415 agent, a real-time streams 416agent, a Structured Query Language (SQL)/Not Only SQL (NoSQL) databases417 agent, a message engines 418 agent, and one or more custom sourcing420 agents. Skilled practitioners of the art will realize that othertypes of sourcing agents 318 may be used in various embodiments and theforegoing is not intended to limit the spirit, scope or intent of theinvention. In various embodiments, the sourcing agents 318 areimplemented to source a variety of multi-site, multi-structured sourcestreams of data described in greater detail herein. In certainembodiments, each of the sourcing agents 318 has a corresponding API.

In various embodiments, the batch uploading 414 agent is implemented forbatch uploading of data to the cognitive platform 310. In theseembodiments, the uploaded data may include a single data element, asingle data record or file, or a plurality of data records or files. Incertain embodiments, the data may be uploaded from more than one sourceand the uploaded data may be in a homogenous or heterogeneous form. Invarious embodiments, the API connectors 415 agent is implemented tomanage interactions with one or more predetermined APIs that areexternal to the cognitive platform 310. As an example, Associated Press®may have their own API for news stories, Expedia® for travelinformation, or the National Weather Service for weather information. Inthese examples, the API connectors 415 agent would be implemented todetermine how to respectively interact with each organization's API suchthat the cognitive platform 310 can receive information.

In various embodiments, the real-time streams 416 agent is implementedto receive various streams of data, such as social media streams (e.g.,Twitter feeds) or other data streams (e.g., device data streams). Inthese embodiments, the streams of data are received in near-real-time.In certain embodiments, the data streams include temporal attributes. Asan example, as data is added to a blog file, it is time-stamped tocreate temporal data. Other examples of a temporal data stream includeTwitter feeds, stock ticker streams, device location streams from adevice that is tracking location, medical devices tracking a patient'svital signs, and intelligent thermostats used to improve energyefficiency for homes.

In certain embodiments, the temporal attributes define a time window,which can be correlated to various elements of data contained in thestream. For example, as a given time window changes, associated data mayhave a corresponding change. In various embodiments, the temporalattributes do not define a time window. As an example, a social mediafeed may not have predetermined time windows, yet it is still temporal.As a result, the social media feed can be processed to determine whathappened in the last 24 hours, what happened in the last hour, whathappened in the last 15 minutes, and then determine related subjectmatter that is trending.

In various embodiments, the SQL/NoSQL databases 417 agent is implementedto interact with one or more target databases familiar to those of skillin the art. For example, the target database may include a SQL, NoSQL,delimited flat file, or other form of database. In various embodiments,the message engines 418 agent is implemented to provide data to thecognitive platform 310 from one or more message engines, such as amessage queue (MQ) system, a message bus, a message broker, anenterprise service bus (ESB), and so forth. Skilled practitioners of theart will realize that there are many such examples of message engineswith which the message engines 418 agent may interact and the foregoingis not intended to limit the spirit, scope or intent of the invention.

In various embodiments, the custom sourcing agents 420, which arepurpose-built, are developed through the use of the developmentenvironment 314, described in greater detail herein. Examples of customsourcing agents 420 include sourcing agents for various electronicmedical record (EMR) systems at various healthcare facilities. Such EMRsystems typically collect a variety of healthcare information, much ofit the same, yet it may be collected, stored and provided in differentways. In this example, the custom sourcing agents 420 allow thecognitive platform 310 to receive information from each disparatehealthcare source.

In various embodiments, the cognitive engine 320 includes a datasetengine 322, a graph engine 326, an insight/learning engine 330, learningagents 434, and foundation components 334. In these and otherembodiments, the dataset engine 322 is implemented as described ingreater detail to establish and maintain a dynamic data ingestion andenrichment pipeline. In various embodiments, the dataset engine 322 mayinclude a pipelines 422 component, an enrichment 423 component, astorage component 424, and one or more enrichment agents 425.

In various embodiments, the pipelines 422 component is implemented toingest various data provided by the sourcing agents 318. Once ingested,this data is converted by the pipelines 422 component into streams ofdata for processing. In certain embodiments, these managed streams areprovided to the enrichment 423 component, which performs data enrichmentoperations familiar to those of skill in the art. As an example, a datastream may be sourced from Associated Press® by a sourcing agent 318 andprovided to the dataset engine 322. The pipelines 422 component receivesthe data stream and routes it to the enrichment 423 component, whichthen enriches the data stream by performing sentiment analysis,geotagging, and entity detection operations to generate an enriched datastream. In certain embodiments, the enrichment operations includefiltering operations familiar to skilled practitioners of the art. Tofurther the preceding example, the Associated Press® data stream may befiltered by a predetermined geography attribute to generate an enricheddata stream.

The enriched data stream is then subsequently stored, as described ingreater detail herein, in a predetermined location. In variousembodiments, the enriched data stream is cached by the storage 424component to provide a local version of the enriched data stream. Incertain embodiments, the cached, enriched data stream is implemented tobe “replayed” by the cognitive engine 320. In one embodiment, thereplaying of the cached, enriched data stream allows incrementalingestion of the enriched data stream instead of ingesting the entireenriched data stream at one time. In various embodiments, one or moreenrichment agents 425 are implemented to be invoked by the enrichmentcomponent 423 to perform one or more enrichment operations described ingreater detail herein.

In various embodiments, the graph query engine 326 is implemented toreceive and process queries such that they can be bridged into acognitive graph, as described in greater detail herein, through the useof a bridging agent. In these embodiments, the graph query engine 326may include a query 426 component, a translate 427 component, a bridge428 component, and one or more bridging agents 429.

In various embodiments, the query 426 component is implemented tosupport natural language queries. In these and other embodiments, thequery 426 component receives queries, processes them (e.g., using NLPprocesses), and then maps the processed query to a target cognitivegraph. In various embodiments, the translate 427 component isimplemented to convert the processed queries provided by the query 426component into a form that can be used to query a target cognitivegraph. To further differentiate the distinction between thefunctionality respectively provided by the query 426 and translate 427components, the query 426 component is oriented toward understanding aquery from a user. In contrast, the translate 427 component is orientedto translating a query that is understood into a form that can be usedto query a cognitive graph.

In various embodiments, the bridge 428 component is implemented togenerate an answer to a query provided by the translate 427 component.In certain embodiments, the bridge 428 component is implemented toprovide domain-specific responses when bridging a translated query to acognitive graph. For example, the same query bridged to a targetcognitive graph by the bridge 428 component may result in differentanswers for different domains, dependent upon domain-specific bridgingoperations performed by the bridge 428 component.

To further differentiate the distinction between the translate 427component and the bridging 428 component, the translate 427 componentrelates to a general domain translation of a question. In contrast, thebridging 428 component allows the question to be asked in the context ofa specific domain (e.g., healthcare, travel, etc.), given what is knownabout the data. In certain embodiments, the bridging 428 component isimplemented to process what is known about the translated query, in thecontext of the user, to provide an answer that is relevant to a specificdomain.

As an example, a user may ask, “Where should I eat today?” If the userhas been prescribed a particular health regimen, the bridging 428component may suggest a restaurant with a “heart healthy” menu. However,if the user is a business traveler, the bridging 428 component maysuggest the nearest restaurant that has the user's favorite food. Invarious embodiments, the bridging 428 component may provide answers, orsuggestions, that are composed and ranked according to a specific domainof use. In various embodiments, the bridging agent 429 is implemented tointeract with the bridging component 428 to perform bridging operationsdescribed in greater detail herein. In these embodiments, the bridgingagent interprets a translated query generated by the query 426 componentwithin a predetermined user context, and then maps it to predeterminednodes and links within a target cognitive graph.

In various embodiments, the insight/learning engine 330 is implementedto encapsulate a predetermined algorithm, which is then applied to atarget cognitive graph to generate a result, such as a cognitive insightor a recommendation. In certain embodiments, one or more such algorithmsmay contribute to answering a specific question and provide additionalcognitive insights or recommendations. In these and other embodiments,the insight/learning engine 330 is implemented to performinsight/learning operations, described in greater detail herein. Invarious embodiments, the insight/learning engine 330 may include adiscover/visibility 430 component, a predict 431 component, arank/recommend 432 component, and one or more insight 433 agents.

In various embodiments, the discover/visibility 430 component isimplemented to provide detailed information related to a predeterminedtopic, such as a subject or an event, along with associated historicalinformation. In certain embodiments, the predict 431 component isimplemented to perform predictive operations to provide insight intowhat may next occur for a predetermined topic. In various embodiments,the rank/recommend 432 component is implemented to perform ranking andrecommendation operations to provide a user prioritized recommendationsassociated with a provided cognitive insight.

In certain embodiments, the insight/learning engine 330 may includeadditional components. For example the additional components may includeclassification algorithms, clustering algorithms, and so forth. Skilledpractitioners of the art will realize that many such additionalcomponents are possible and that the foregoing is not intended to limitthe spirit, scope or intent of the invention. In various embodiments,the insights agents 433 are implemented to create a visual data story,highlighting user-specific insights, relationships and recommendations.As a result, it can share, operationalize, or track business insights invarious embodiments. In various embodiments, the learning agent 434 workin the background to continually update the cognitive graph, asdescribed in greater detail herein, from each unique interaction withdata and users.

In various embodiments, the destination agents 336 are implemented topublish cognitive insights to a consumer of cognitive insight data.Examples of such consumers of cognitive insight data include targetdatabases, business intelligence applications, and mobile applications.In various embodiments, the destination agents 336 may include aHypertext Transfer Protocol (HTTP) stream 440 agent, an API connectors441 agent, a databases 442 agent, a message engines 443 agent, a mobilepush notification 444 agent, and one or more custom destination 446agents. Skilled practitioners of the art will realize that other typesof destination agents 318 may be used in various embodiments and theforegoing is not intended to limit the spirit, scope or intent of theinvention. In certain embodiments, each of the destination agents 318has a corresponding API.

In various embodiments, the HTTP stream 440 agent is implemented forproviding various HTTP streams of cognitive insight data to apredetermined cognitive data consumer. In these embodiments, theprovided HTTP streams may include various HTTP data elements familiar tothose of skill in the art. In certain embodiments, the HTTP streams ofdata are provided in near-real-time. In various embodiments, the APIconnectors 441 agent is implemented to manage interactions with one ormore predetermined APIs that are external to the cognitive platform 310.As an example, various target databases, business intelligenceapplications, and mobile applications may each have their own uniqueAPI.

In various embodiments, the databases 442 agent is implemented forprovision of cognitive insight data to one or more target databasesfamiliar to those of skill in the art. For example, the target databasemay include a SQL, NoSQL, delimited flat file, or other form ofdatabase. In these embodiments, the provided cognitive insight data mayinclude a single data element, a single data record or file, or aplurality of data records or files. In certain embodiments, the data maybe provided to more than one cognitive data consumer and the provideddata may be in a homogenous or heterogeneous form. In variousembodiments, the message engines 443 agent is implemented to providecognitive insight data to one or more message engines, such as a messagequeue (MQ) system, a message bus, a message broker, an enterpriseservice bus (ESB), and so forth. Skilled practitioners of the art willrealize that there are many such examples of message engines with whichthe message engines 443 agent may interact and the foregoing is notintended to limit the spirit, scope or intent of the invention.

In various embodiments, the custom destination agents 420, which arepurpose-built, are developed through the use of the developmentenvironment 314, described in greater detail herein. Examples of customdestination agents 420 include destination agents for various electronicmedical record (EMR) systems at various healthcare facilities. Such EMRsystems typically collect a variety of healthcare information, much ofit the same, yet it may be collected, stored and provided in differentways. In this example, the custom destination agents 420 allow such EMRsystems to receive cognitive insight data in a form they can use.

In various embodiments, data that has been cleansed, normalized andenriched by the dataset engine, as described in greater detail herein,is provided by a destination agent 336 to a predetermined destination,likewise described in greater detail herein. In these embodiments,neither the graph query engine 326 nor the insight/learning engine 330are implemented to perform their respective functions.

In various embodiments, the foundation components 334 are implemented toenable the dataset engine 322, the graph query engine 326, and theinsight/learning engine 330 to perform their respective operations andprocesses. In these and other embodiments, the foundation components 334may include an NLP core 436 component, an NLP services 437 component,and a dynamic pipeline engine 438. In various embodiments, the NLP core436 component is implemented to provide a set of predetermined NLPcomponents for performing various NLP operations described in greaterdetail herein.

In these embodiments, certain of these NLP core components are surfacedthrough the NLP services 437 component, while some are used aslibraries. Examples of operations that are performed with suchcomponents include dependency parsing, parts-of-speech tagging, sentencepattern detection, and so forth. In various embodiments, the NLPservices 437 component is implemented to provide various internal NLPservices, which are used to perform entity detection, summarization, andother operations, likewise described in greater detail herein. In theseembodiments, the NLP services 437 component is implemented to interactwith the NLP core 436 component to provide predetermined NLP services,such as summarizing a target paragraph.

In various embodiments, the dynamic pipeline engine 438 is implementedto interact with the dataset engine 322 to perform various operationsrelated to receiving one or more sets of data from one or more sourcingagents, apply enrichment to the data, and then provide the enriched datato a predetermined destination. In these and other embodiments, thedynamic pipeline engine 438 manages the distribution of these variousoperations to a predetermined compute cluster and tracks versioning ofthe data as it is processed across various distributed computingresources. In certain embodiments, the dynamic pipeline engine 438 isimplemented to perform data sovereignty management operations tomaintain sovereignty of the data.

In various embodiments, the platform data 338 includes various datarepositories, described in greater detail herein, that are accessed bythe cognitive platform 310 to generate cognitive insights. In theseembodiments, the platform data 338 repositories may include repositoriesof dataset metadata 456, cognitive graphs 457, models 459, crawl data460, and management metadata 461. In various embodiments, the datasetmetadata 456 is associated with curated data 458 contained in therepository of cognitive graphs 457. In these and other embodiments, therepository of dataset metadata 456 contains dataset metadata thatsupports operations performed by the storage 424 component of thedataset engine 322. For example, if a Mongo® NoSQL database with tenmillion items is being processed, and the cognitive platform 310 failsafter ingesting nine million of the items, then the dataset metadata 456may be able to provide a checkpoint that allows ingestion to continue atthe point of failure instead restarting the ingestion process.

Those of skill in the art will realize that the use of such datasetmetadata 456 in various embodiments allows the dataset engine 322 to bestateful. In certain embodiments, the dataset metadata 456 allowssupport of versioning. For example versioning may be used to trackversions of modifications made to data, such as in data enrichmentprocesses described in greater detail herein. As another example,geotagging information may have been applied to a set of data during afirst enrichment process, which creates a first version of enricheddata. Adding sentiment data to the same million records during a secondenrichment process creates a second version of enriched data. In thisexample, the dataset metadata stored in the dataset metadata 456provides tracking of the different versions of the enriched data and thedifferences between the two.

In various embodiments, the repository of cognitive graphs 457 isimplemented to store cognitive graphs generated, accessed, and updatedby the cognitive engine 320 in the process of generating cognitiveinsights. In various embodiments, the repository of cognitive graphs 457may include one or more repositories of curated data 458, described ingreater detail herein. In certain embodiments, the repositories ofcurated data 458 includes data that has been curated by one or moreusers, machine operations, or a combination of the two, by performingvarious sourcing, filtering, and enriching operations described ingreater detail herein. In these and other embodiments, the curated data458 is ingested by the cognitive platform 310 and then processed, aslikewise described in greater detail herein, to generate cognitiveinsights. In various embodiments, the repository of models 459 isimplemented to store models that are generated, accessed, and updated bythe cognitive engine 320 in the process of generating cognitiveinsights. As used herein, models broadly refer to machine learningmodels. In certain embodiments, the models include one or morestatistical models.

In various embodiments, the crawl framework 452 is implemented tosupport various crawlers 454 familiar to skilled practitioners of theart. In certain embodiments, the crawlers 454 are custom configured forvarious target domains. For example, different crawlers 454 may be usedfor various travel forums, travel blogs, travel news and other travelsites. In various embodiments, data collected by the crawlers 454 isprovided by the crawl framework 452 to the repository of crawl data 460.In these embodiments, the collected crawl data is processed and thenstored in a normalized form in the repository of crawl data 460. Thenormalized data is then provided to SQL/NoSQL database 417 agent, whichin turn provides it to the dataset engine 322. In one embodiment, thecrawl database 460 is a NoSQL database, such as Mongo®.

In various embodiments, the repository of management metadata 461 isimplemented to store user-specific metadata used by the managementconsole 312 to manage accounts (e.g., billing information) and projects.In certain embodiments, the user-specific metadata stored in therepository of management metadata 461 is used by the management console312 to drive processes and operations within the cognitive platform 310for a predetermined project. In various embodiments, the user-specificmetadata stored in the repository of management metadata 461 is used toenforce data sovereignty. It will be appreciated that many suchembodiments are possible and the foregoing is not intended to limit thespirit, scope or intent of the invention.

Referring now to FIG. 4 c, the cloud infrastructure 340 may include acognitive cloud management 342 component and a cloud analyticsinfrastructure 344 component in various embodiments. Current examples ofa cloud infrastructure 340 include Amazon Web Services (AWS®), availablefrom Amazon.com® of Seattle, Wash., IBM® Softlayer, available fromInternational Business Machines of Armonk, N.Y., and Nebula/Openstack, ajoint project between Rackspace Hosting®, of Windcrest, Tex., and theNational Aeronautics and Space Administration (NASA). In theseembodiments, the cognitive cloud management 342 component may include amanagement playbooks 468 sub-component, a cognitive cloud managementconsole 469 sub-component, a data console 470 sub-component, an assetrepository 471 sub-component. In certain embodiments, the cognitivecloud management 342 component may include various other sub-components.

In various embodiments, the management playbooks 468 sub-component isimplemented to automate the creation and management of the cloudanalytics infrastructure 344 component along with various otheroperations and processes related to the cloud infrastructure 340. Asused herein, “management playbooks” broadly refers to any set ofinstructions or data, such as scripts and configuration data, that isimplemented by the management playbooks 468 sub-component to perform itsassociated operations and processes.

In various embodiments, the cognitive cloud management console 469sub-component is implemented to provide a user visibility and managementcontrols related to the cloud analytics infrastructure 344 componentalong with various other operations and processes related to the cloudinfrastructure 340. In various embodiments, the data console 470sub-component is implemented to manage platform data 338, described ingreater detail herein. In various embodiments, the asset repository 471sub-component is implemented to provide access to various cognitivecloud infrastructure assets, such as asset configurations, machineimages, and cognitive insight stack configurations.

In various embodiments, the cloud analytics infrastructure 344 componentmay include a data grid 472 sub-component, a distributed compute engine474 sub-component, and a compute cluster management 476 sub-component.In these embodiments, the cloud analytics infrastructure 344 componentmay also include a distributed object storage 478 sub-component, adistributed full text search 480 sub-component, a document database 482sub-component, a graph database 484 sub-component, and various othersub-components. In various embodiments, the data grid 472 sub-componentis implemented to provide distributed and shared memory that allows thesharing of objects across various data structures. One example of a datagrid 472 sub-component is Redis, an open-source, networked, in-memory,key-value data store, with optional durability, written in ANSI C. Invarious embodiments, the distributed compute engine 474 sub-component isimplemented to allow the cognitive platform 310 to perform variouscognitive insight operations and processes in a distributed computingenvironment. Examples of such cognitive insight operations and processesinclude batch operations and streaming analytics processes.

In various embodiments, the compute cluster management 476 sub-componentis implemented to manage various computing resources as a computecluster. One such example of such a compute cluster management 476sub-component is Mesos/Nimbus, a cluster management platform thatmanages distributed hardware resources into a single pool of resourcesthat can be used by application frameworks to efficiently manageworkload distribution for both batch jobs and long-running services. Invarious embodiments, the distributed object storage 478 sub-component isimplemented to manage the physical storage and retrieval of distributedobjects (e.g., binary file, image, text, etc.) in a cloud environment.Examples of a distributed object storage 478 sub-component includeAmazon S3®, available from Amazon.com of Seattle, Wash., and Swift, anopen source, scalable and redundant storage system.

In various embodiments, the distributed full text search 480sub-component is implemented to perform various full text searchoperations familiar to those of skill in the art within a cloudenvironment. In various embodiments, the document database 482sub-component is implemented to manage the physical storage andretrieval of structured data in a cloud environment. Examples of suchstructured data include social, public, private, and device data, asdescribed in greater detail herein. In certain embodiments, thestructured data includes data that is implemented in the JavaScriptObject Notation (JSON) format. One example of a document database 482sub-component is Mongo, an open source cross-platform document-orienteddatabase. In various embodiments, the graph database 484 sub-componentis implemented to manage the physical storage and retrieval of cognitivegraphs. One example of a graph database 484 sub-component is GraphDB, anopen source graph database familiar to those of skill in the art.

FIG. 5 is a simplified process diagram of cognitive inference andlearning system (CILS) operations performed in accordance with anembodiment of the invention. In various embodiments, these CILSoperations may include a perceive 506 phase, a relate 508 phase, anoperate 510 phase, a process and execute 512 phase, and a learn 514phase. In these and other embodiments, the CILS 118 shown in FIG. 2 isimplemented to mimic cognitive processes associated with the humanbrain. In various embodiments, the CILS operations are performed throughthe implementation of a cognitive platform 310, described in greaterdetail herein. In these and other embodiments, the cognitive platform310 may be implemented within a cloud analytics infrastructure 344,which in turn is implemented within a cloud infrastructure 340, likewisedescribed in greater detail herein.

In various embodiments, multi-site, multi-structured source streams 504are provided by sourcing agents, as described in greater detail herein.In these embodiments, the source streams 504 are dynamically ingested inreal-time during the perceive 506 phase, and based upon a predeterminedcontext, extraction, parsing, and tagging operations are performed onlanguage, text and images contained in the source streams 504. Automaticfeature extraction and modeling operations are then performed with thepreviously processed source streams 504 during the relate 508 phase togenerate queries to identify related data (i.e., corpus expansion).

In various embodiments, operations are performed during the operate 510phase to discover, summarize and prioritize various concepts, which arein turn used to generate actionable recommendations and notificationsassociated with predetermined plan-based optimization goals. Theresulting actionable recommendations and notifications are thenprocessed during the process and execute 512 phase to provide cognitiveinsights, such as recommendations, to various predetermined destinationsand associated application programming interfaces (APIs) 524.

In various embodiments, features from newly-observed data areautomatically extracted from user feedback during the learn 514 phase toimprove various analytical models. In these embodiments, the learn 514phase includes feedback on observations generated during the relate 508phase, which is provided to the perceive 506 phase. Likewise, feedbackon decisions resulting from operations performed during the operate 510phase, and feedback on results resulting from operations performedduring the process and execute 512 phase, are also provided to theperceive 506 phase.

In various embodiments, user interactions result from operationsperformed during the process and execute 512 phase. In theseembodiments, data associated with the user interactions are provided tothe perceive 506 phase as unfolding interactions 522, which includeevents that occur external to the CILS operations described in greaterdetail herein. As an example, a first query from a user may be submittedto the CILS system, which in turn generates a first cognitive insight,which is then provided to the user. In response, the user may respond byproviding a first response, or perhaps a second query, either of whichis provided in the same context as the first query. The CILS receivesthe first response or second query, performs various CILS operations,and provides the user a second cognitive insight. As before, the usermay respond with a second response or a third query, again in thecontext of the first query. Once again, the CILS performs various CILSoperations and provides the user a third cognitive insight, and soforth. In this example, the provision of cognitive insights to the user,and their various associated responses, results in unfoldinginteractions 522, which in turn result in a stateful dialog that evolvesover time. Skilled practitioners of the art will likewise realize thatsuch unfolding interactions 522, occur outside of the CILS operationsperformed by the cognitive platform 310.

FIG. 6 depicts the lifecycle of CILS agents implemented in accordancewith an embodiment of the invention to perform CILS operations. Invarious embodiments, the CILS agents lifecycle 602 may includeimplementation of a sourcing 318 agent, an enrichment 425 agent, abridging 429 agent, an insight 433 agent, a destination 336 agent, and alearning 434 agent. In these embodiments, the sourcing 318 agent isimplemented to source a variety of multi-site, multi-structured sourcestreams of data described in greater detail herein. These sourced datastreams are then provided to an enrichment 425 agent, which then invokesan enrichment component to perform enrichment operations to generateenriched data streams, likewise described in greater detail herein.

The enriched data streams are then provided to a bridging 429 agent,which is used to perform bridging operations described in greater detailherein. In turn, the results of the bridging operations are provided toan insight 433 agent, which is implemented as described in greaterdetail herein to create a visual data story, highlighting user-specificinsights, relationships and recommendations. The resulting visual datastory is then provided to a destination 336 agent, which is implementedto publish cognitive insights to a consumer of cognitive insight data,likewise as described in greater detail herein. In response, theconsumer of cognitive insight data provides feedback to a learning 434agent, which is implemented as described in greater detail herein toprovide the feedback to the sourcing agent 318, at which point the CILSagents lifecycle 602 is continued. From the foregoing, skilledpractitioners of the art will recognize that each iteration of thecognitive agents lifecycle 602 provides more informed cognitiveinsights.

FIG. 7 is a simplified block diagram of a plurality of cognitiveplatforms implemented in accordance with an embodiment of the inventionwithin a hybrid cloud infrastructure. In this embodiment, the hybridcloud infrastructure 740 includes a cognitive cloud management 342component, a hosted 704 cognitive cloud environment, and a private 706network environment. As shown in FIG. 7, the hosted 704 cognitive cloudenvironment includes a hosted 710 cognitive platform, such as thecognitive platform 310 shown in FIGS. 3, 4 a, and 4 b. In variousembodiments, the hosted 704 cognitive cloud environment may also includea hosted 718 universal knowledge repository and one or more repositoriesof curated public data 714 and licensed data 716. Likewise, the hosted710 cognitive platform may also include a hosted 712 analyticsinfrastructure, such as the cloud analytics infrastructure 344 shown inFIGS. 3 and 4 c.

As likewise shown in FIG. 7, the private 706 network environmentincludes a private 720 cognitive platform, such as the cognitiveplatform 310 shown in FIGS. 3, 4 a, and 4 b. In various embodiments, theprivate 706 network cognitive cloud environment may also include aprivate 728 universal knowledge repository and one or more repositoriesof application data 724 and private data 726. Likewise, the private 720cognitive platform may also include a private 722 analyticsinfrastructure, such as the cloud analytics infrastructure 344 shown inFIGS. 3 and 4 c. In certain embodiments, the private 706 networkenvironment may have one or more private 736 cognitive applicationsimplemented to interact with the private 720 cognitive platform.

As used herein, a universal knowledge repository broadly refers to acollection of knowledge elements that can be used in various embodimentsto generate one or more cognitive insights described in greater detailherein. In various embodiments, these knowledge elements may includefacts (e.g., milk is a dairy product), information (e.g., an answer to aquestion), descriptions (e.g., the color of an automobile), skills(e.g., the ability to install plumbing fixtures), and other classes ofknowledge familiar to those of skill in the art. In these embodiments,the knowledge elements may be explicit or implicit. As an example, thefact that water freezes at zero degrees centigrade would be an explicitknowledge element, while the fact that an automobile mechanic knows howto repair an automobile would be an implicit knowledge element.

In certain embodiments, the knowledge elements within a universalknowledge repository may also include statements, assertions, beliefs,perceptions, preferences, sentiments, attitudes or opinions associatedwith a person or a group. As an example, user ‘A’ may prefer the pizzaserved by a first restaurant, while user ‘B’ may prefer the pizza servedby a second restaurant. Furthermore, both user ‘A’ and ‘B’ are firmly ofthe opinion that the first and second restaurants respectively serve thevery best pizza available. In this example, the respective preferencesand opinions of users ‘A’ and B′ regarding the first and secondrestaurant may be included in the universal knowledge repository 880 asthey are not contradictory. Instead, they are simply knowledge elementsrespectively associated with the two users and can be used in variousembodiments for the generation of various cognitive insights, asdescribed in greater detail herein.

In various embodiments, individual knowledge elements respectivelyassociated with the hosted 718 and private 728 universal knowledgerepositories may be distributed. In one embodiment, the distributedknowledge elements may be stored in a plurality of data stores familiarto skilled practitioners of the art. In this embodiment, the distributedknowledge elements may be logically unified for various implementationsof the hosted 718 and private 728 universal knowledge repositories. Incertain embodiments, the hosted 718 and private 728 universal knowledgerepositories may be respectively implemented in the form of a hosted orprivate universal cognitive graph. In these embodiments, nodes withinthe hosted or private universal graph contain one or more knowledgeelements.

In various embodiments, a secure tunnel 730, such as a virtual privatenetwork (VPN) tunnel, is implemented to allow the hosted 710 cognitiveplatform and the private 720 cognitive platform to communicate with oneanother. In these various embodiments, the ability to communicate withone another allows the hosted 710 and private 720 cognitive platforms towork collaboratively when generating cognitive insights described ingreater detail herein. In various embodiments, the hosted 710 cognitiveplatform accesses knowledge elements stored in the hosted 718 universalknowledge repository and data stored in the repositories of curatedpublic data 714 and licensed data 716 to generate various cognitiveinsights. In certain embodiments, the resulting cognitive insights arethen provided to the private 720 cognitive platform, which in turnprovides them to the one or more private cognitive applications 736.

In various embodiments, the private 720 cognitive platform accessesknowledge elements stored in the private 728 universal knowledgerepository and data stored in the repositories of application data 724and private data 726 to generate various cognitive insights. In turn,the resulting cognitive insights are then provided to the one or moreprivate cognitive applications 736. In certain embodiments, the private720 cognitive platform accesses knowledge elements stored in the hosted718 and private 728 universal knowledge repositories and data stored inthe repositories of curated public data 714, licensed data 716,application data 724 and private data 726 to generate various cognitiveinsights. In these embodiments, the resulting cognitive insights are inturn provided to the one or more private cognitive applications 736.

In various embodiments, the secure tunnel 730 is implemented for thehosted 710 cognitive platform to provide 732 predetermined data andknowledge elements to the private 720 cognitive platform. In oneembodiment, the provision 732 of predetermined knowledge elements allowsthe hosted 718 universal knowledge repository to be replicated as theprivate 728 universal knowledge repository. In another embodiment, theprovision 732 of predetermined knowledge elements allows the hosted 718universal knowledge repository to provide updates 734 to the private 728universal knowledge repository. In certain embodiments, the updates 734to the private 728 universal knowledge repository do not overwrite otherdata. Instead, the updates 734 are simply added to the private 728universal knowledge repository.

In one embodiment, knowledge elements that are added to the private 728universal knowledge repository are not provided to the hosted 718universal knowledge repository. As an example, an airline may not wishto share private information related to its customer's flights, theprice paid for tickets, their awards program status, and so forth. Inanother embodiment, predetermined knowledge elements that are added tothe private 728 universal knowledge repository may be provided to thehosted 718 universal knowledge repository. As an example, the operatorof the private 720 cognitive platform may decide to licensepredetermined knowledge elements stored in the private 728 universalknowledge repository to the operator of the hosted 710 cognitiveplatform. To continue the example, certain knowledge elements stored inthe private 728 universal knowledge repository may be anonymized priorto being provided for inclusion in the hosted 718 universal knowledgerepository. In one embodiment, only private knowledge elements arestored in the private 728 universal knowledge repository. In thisembodiment, the private 720 cognitive platform may use knowledgeelements stored in both the hosted 718 and private 728 universalknowledge repositories to generate cognitive insights. Skilledpractitioners of the art will recognize that many such embodiments arepossible and the foregoing is not intended to limit the spirit, scope orintent of the invention.

FIG. 8 is a simplified process flow diagram of composite cognitiveinsight generation and feedback operations performed in accordance withan embodiment of the invention. As used herein, a composite cognitiveinsight broadly refers to a set of cognitive insights generated as aresult of orchestrating a predetermined set of independent cognitiveagents, referred to herein as insight agents. In various embodiments,the insight agents use a cognitive graph, such as an applicationcognitive graph 882 as their data source to respectively generateindividual cognitive insights. As used herein, an application cognitivegraph 882 broadly refers to a cognitive graph that is associated with apredetermined cognitive application 304. In certain embodiments,different cognitive applications 304 may interact with differentapplication cognitive graphs 882 to generate individual cognitiveinsights for a user. In various embodiments, the resulting individualcognitive insights are then composed to generate a set of cognitivecomposite insights, which in turn is provided to a user in the form of acognitive insight summary 848.

In various embodiments, the orchestration of the selected insight agentsis performed by the cognitive insight/learning engine 330 shown in FIGS.3 and 4 a. In certain embodiments, a predetermined subset of insightagents is selected to provide composite cognitive insights to satisfy agraph query 844, a contextual situation, or some combination thereof.For example, it may be determined, as described in greater detailherein, that a particular subset of insight agents may be suited toprovide a composite cognitive insight related to a particular user of aparticular device, at a particular location, at a particular time, for aparticular purpose.

In certain embodiments, the insight agents are selected fororchestration as a result of receiving direct or indirect input data 842from a user. In various embodiments, the direct user input may be anatural language inquiry. In certain embodiments, the indirect userinput data 842 may include the location of a user's device or thepurpose for which it is being used. As an example, the GeographicalPositioning System (GPS) coordinates of the location of a user's mobiledevice may be received as indirect user input data 842. As anotherexample, a user may be using the integrated camera of their mobiledevice to take a photograph of a location, such as a restaurant, or anitem, such as a food product. In certain embodiments, the direct orindirect user input data 842 may include personal information that canbe used to identify the user. Skilled practitioners of the art willrecognize that many such embodiments are possible and the foregoing isnot intended to limit the spirit, scope or intent of the invention.

In various embodiments, composite cognitive insight generation andfeedback operations may be performed in various phases. In thisembodiment, these phases include a data lifecycle 840 phase, a learning838 phase, and an application/insight composition 840 phase. In the datalifecycle 836 phase, a predetermined instantiation of a cognitiveplatform 810 sources social data 812, public data 814, licensed data816, and proprietary data 818 from various sources as described ingreater detail herein. In various embodiments, an example of a cognitiveplatform 810 instantiation is the cognitive platform 310 shown in FIGS.3, 4 a, and 4 b. In this embodiment, the instantiation of a cognitiveplatform 810 includes a source 806 component, a process 808 component, adeliver 810 component, a cleanse 820 component, an enrich 822 component,a filter/transform 824 component, and a repair/reject 826 component.Likewise, as shown in FIG. 8, the process 808 component includes arepository of models 828, described in greater detail herein.

In various embodiments, the process 808 component is implemented toperform various composite insight generation and other processingoperations described in greater detail herein. In these embodiments, theprocess 808 component is implemented to interact with the source 806component, which in turn is implemented to perform various data sourcingoperations described in greater detail herein. In various embodiments,the sourcing operations are performed by one or more sourcing agents, aslikewise described in greater detail herein. The resulting sourced datais then provided to the process 808 component. In turn, the process 808component is implemented to interact with the cleanse 820 component,which is implemented to perform various data cleansing operationsfamiliar to those of skill in the art. As an example, the cleanse 820component may perform data normalization or pruning operations, likewiseknown to skilled practitioners of the art. In certain embodiments, thecleanse 820 component may be implemented to interact with therepair/reject 826 component, which in turn is implemented to performvarious data repair or data rejection operations known to those of skillin the art.

Once data cleansing, repair and rejection operations are completed, theprocess 808 component is implemented to interact with the enrich 822component, which is implemented in various embodiments to performvarious data enrichment operations described in greater detail herein.Once data enrichment operations have been completed, the process 808component is likewise implemented to interact with the filter/transform824 component, which in turn is implemented to perform data filteringand transformation operations described in greater detail herein.

In various embodiments, the process 808 component is implemented togenerate various models, described in greater detail herein, which arestored in the repository of models 828. The process 808 component islikewise implemented in various embodiments to use the sourced data togenerate one or more cognitive graphs, such as an application cognitivegraph 882, as described in greater detail herein. In variousembodiments, the process 808 component is implemented to gain anunderstanding of the data sourced from the sources of social data 812,public data 814, licensed data 816, and proprietary data 818, whichassist in the automated generation of the application cognitive graph882.

The process 808 component is likewise implemented in various embodimentsto perform bridging 846 operations, described in greater detail herein,to access the application cognitive graph 882. In certain embodiments,the bridging 846 operations are performed by bridging agents, likewisedescribed in greater detail herein. In various embodiments, theapplication cognitive graph 882 is accessed by the process 808 componentduring the learning 836 phase of the composite cognitive insightgeneration operations.

In various embodiments, a cognitive application 304 is implemented toreceive input data 842 associated with an individual user or a group ofusers. In these embodiments, the input data 842 may be direct, such as auser query or mouse click, or indirect, such as the current time orGeographical Positioning System (GPS) data received from a mobile deviceassociated with a user. In various embodiments, the indirect input data842 may include contextual data, described in greater detail herein.Once it is received, the input data 842 is then submitted by thecognitive application 304 to a graph query engine 326 during theapplication/insight composition 840 phase. In turn, the graph queryengine 326 processes the submitted input data 842 to generate a graphquery 844, as described in greater detail herein. The graph query 844 isthen used to query the application cognitive graph 882, which results inthe generation of one or more composite cognitive insights, likewisedescribed in greater detail herein. In certain embodiments, the graphquery 844 uses predetermined knowledge elements stored in the universalknowledge repository 880 when querying the application cognitive graph882 to generate the one or more composite insights.

In various embodiments, a set of contextually-related interactionsbetween a cognitive application 304 and the application cognitive graph882 are represented as a corresponding set of nodes in a predeterminedcognitive session graph, which is then stored in a repository ofcognitive session graphs ‘1’ through ‘n’ 852. As used herein, acognitive session graph broadly refers to a cognitive graph whose nodesare associated with a cognitive session. As used herein, a cognitivesession broadly refers to a predetermined user, group of users, theme,topic, issue, question, intent, goal, objective, task, assignment,process, situation, requirement, condition, responsibility, location,period of time, or any combination thereof.

As an example, the application cognitive graph 882 may be unaware of aparticular user's preferences, which are likely stored in acorresponding user profile. To further the example, a user may typicallychoose a particular brand or manufacturer when shopping for a given typeof product, such as cookware. A record of each query regarding thatbrand of cookware, or its selection, is iteratively stored in apredetermined cognitive session graph that is associated with the userand stored in a repository of cognitive session graphs ‘1’ through ‘n’852. As a result, the preference of that brand of cookware is rankedhigher, and is presented in response to contextually-related queries,even when the preferred brand of cookware is not explicitly referencedby the user. To continue the example, the user may make a number ofqueries over a period of days or weeks, yet the queries are allassociated with the same cognitive session graph that is associated withthe user and stored in a repository of cognitive session graphs ‘1’through ‘n’ 852, regardless of when each query is made.

As another example, a user queries a cognitive application 304 duringbusiness hours to locate an upscale restaurant located close their placeof business. As a result, a first cognitive session graph stored in arepository of cognitive session graphs ‘1’ through ‘n’ 852 is associatedwith the user's query, which results in the provision of compositecognitive insights related to restaurants suitable for businessmeetings. To continue the example, the same user queries the samecognitive application 304 during the weekend to locate a casualrestaurant located close to their home. As a result, a second cognitivesession graph stored in a repository of cognitive session graphs ‘1’through ‘n’ 852 is associated with the user's query, which results inthe provision of composite cognitive insights related to restaurantssuitable for family meals. In these examples, the first and secondcognitive session graphs are both associated with the same user, but fortwo different purposes, which results in the provision of two differentsets of composite cognitive insights.

As yet another example, a group of customer support representatives istasked with resolving technical issues customers may have with apredetermined product. In this example, the product and the group ofcustomer support representatives are collectively associated with apredetermined cognitive session graph stored in a repository ofcognitive session graphs ‘1’ through ‘n’ 852. To continue the example,individual customer support representatives may submit queries relatedto the product to a cognitive application 304, such as a knowledge baseapplication. In response, a predetermined cognitive session graph storedin a repository of cognitive session graphs ‘1’ through ‘n’ 852 is used,along with the universal knowledge repository 880 and applicationcognitive graph 882, to generate individual or composite cognitiveinsights to resolve a technical issue for a customer. In this example,the cognitive application 304 may be queried by the individual customersupport representatives at different times during some predeterminedtime interval, yet the same cognitive session graph stored in arepository of cognitive session graphs ‘1’ through ‘n’ 852 is used togenerate composite cognitive insights related to the product.

In various embodiments, each cognitive session graph associated with auser and stored in a repository of cognitive session graphs ‘1’ through‘n’ 852 includes one or more direct or indirect user queries representedas nodes, and the time at which they were asked, which are in turnlinked 854 to nodes that appear in the application cognitive graph 882.In certain embodiments, each individual session graph that is associatedwith the user and stored in a repository of cognitive session graphs ‘1’through ‘n’ 852 introduces edges that are not already present in theapplication cognitive graph 882. More specifically, each of the sessiongraphs that is associated with the user and stored in a repository ofcognitive session graphs ‘1’ through ‘n’ 852 establishes variousrelationships that the application cognitive graph 882 does not alreadyhave.

In various embodiments, individual graph queries 844 associated with apredetermined session graph stored in a repository of cognitive sessiongraphs ‘1’ through ‘n’ 852 are likewise provided to predeterminedinsight agents to perform various kinds of analyses. In certainembodiments, each insight agent performs a different kind of analysis.In various embodiments, different insight agents may perform the same,or similar, analyses. In certain embodiments, different agentsperforming the same or similar analyses may be competing betweenthemselves.

For example, a user may be a realtor that has a young, uppermiddle-class, urban-oriented clientele that typically enjoys eating attrendy restaurants that are in walking distance of where they live. As aresult, the realtor may be interested in knowing about new or popularrestaurants that are in walking distance of their property listings thathave a young, middle-class clientele. In this example, the user'squeries may result the assignment of predetermined insight agents toperform analysis of various social media interactions to identify suchrestaurants that have received favorable reviews. To continue theexample, the resulting composite insights may be provided as a rankedlist of candidate restaurants that may be suitable venues for therealtor to meet his clients.

In various embodiments, the process 808 component is implemented toprovide these composite cognitive insights to the deliver 810 component,which in turn is implemented to deliver the composite cognitive insightsin the form of a cognitive insight summary 848 to the cognitiveapplication 304. In these embodiments, the cognitive platform 810 isimplemented to interact with an insight front-end 856 component, whichprovides a composite insight and feedback interface with the cognitiveapplication 304. In certain embodiments, the insight front-end 856component includes an insight Application Program Interface (API) 858and a feedback API 860, described in greater detail herein. In theseembodiments, the insight API 858 is implemented to convey the cognitiveinsight summary 848 to the cognitive application 304. Likewise, thefeedback API 860 is used to convey associated direct or indirect userfeedback 862 to the cognitive platform 810. In certain embodiments, thefeedback API 860 provides the direct or indirect user feedback 862 tothe repository of models 828 described in greater detail herein.

To continue the preceding example, the user may have received a list ofcandidate restaurants that may be suitable venues for meeting hisclients. However, one of his clients has a pet that they like to takewith them wherever they go. As a result, the user provides feedback 862that he is looking for a restaurant that is pet-friendly. The providedfeedback 862 is in turn provided to the insight agents to identifycandidate restaurants that are also pet-friendly. In this example, thefeedback 862 is stored in the appropriate cognitive session graph storedin a repository of cognitive session graphs ‘1’ through ‘n’ 852associated with the user and their original query.

In various embodiments, as described in the descriptive text associatedwith FIG. 5, learning operations are iteratively performed during thelearning 838 phase to provide more accurate and useful compositecognitive insights. In certain of these embodiments, feedback 862received from the user is stored in a predetermined cognitive sessiongraph that is associated with the user and stored in a repository ofcognitive session graphs ‘1’ through ‘n’ 852, which is then used toprovide more accurate composite cognitive insights in response tosubsequent contextually-relevant queries from the user.

As an example, composite cognitive insights provided by a particularinsight agent related to a first subject may not be relevant orparticularly useful to a user of the cognitive application 304. As aresult, the user provides feedback 862 to that effect, which in turn isstored in the appropriate cognitive session graph that is associatedwith the user and stored in a repository of cognitive session graphs ‘1’through ‘n’ 852. Accordingly, subsequent insights provided by theinsight agent related the first subject may be ranked lower, or notprovided, within a cognitive insight summary 848 to the user.Conversely, the same insight agent may provide excellent insightsrelated to a second subject, resulting in positive feedback 862 beingreceived from the user. The positive feedback 862 is likewise stored inthe appropriate cognitive session graph that is associated with the userand stored in a repository of cognitive session graphs ‘1’ through ‘n’852. As a result, subsequent insights provided by the insight agentrelated to the second subject may be ranked higher within a cognitiveinsight summary 848 provided to the user.

In various embodiments, the composite insights provided in eachcognitive insight summary 848 to the cognitive application 304, andcorresponding feedback 862 received from a user in return, is providedin the form of one or more insight streams 864 to an associatedcognitive session graph stored in a repository of cognitive sessiongraphs ‘1’ through ‘n’ 852. In these and other embodiments, the insightstreams 864 may contain information related to the user of the cognitiveapplication 304, the time and date of the provided composite cognitiveinsights and related feedback 862, the location of the user, and thedevice used by the user.

As an example, a query related to upcoming activities that is receivedat 10:00 AM on a Saturday morning from a user's home may returncomposite cognitive insights related to entertainment performancesscheduled for the weekend. Conversely, the same query received at thesame time on a Monday morning from a user's office may return compositeinsights related to business functions scheduled during the work week.In various embodiments, the information contained in the insight streams864 is used to rank the composite cognitive insights provided in thecognitive insight summary 848. In certain embodiments, the compositecognitive insights are continually re-ranked as additional insightstreams 864 are received. Skilled practitioners of the art willrecognize that many such embodiments are possible and the foregoing isnot intended to limit the spirit, scope or intent of the invention.

FIG. 9 is a generalized flowchart of composite cognitive insight andfeedback operations performed in accordance with an embodiment of theinvention. In this embodiment, composite cognitive insight and feedbackoperations are begun in step 902, followed by a cognitive applicationrequesting an Application Programming Interface (API) key from acognitive platform, described in greater detail herein, in step 906. Themethod by which the API key is requested, generated and provided to thecognitive application is a matter of design choice.

A cognitive session token is then issued to the cognitive application instep 906, followed by a cognitive session token being returned to thecognitive application in step 908 to establish a composite cognitiveinsight session. As used herein, a composite cognitive insight sessionbroadly refers to a session with a cognitive application, described ingreater detail herein, where composite cognitive insights are generatedand presented to a user. In various embodiments, the composite cognitiveinsight session may also include the receipt of feedback from the user,described in greater detail herein. In one embodiment, the cognitivesession token is used to establish a composite cognitive insight sessionthat generates a new cognitive session graph. In another embodiment, thecognitive session token is used to establish a composite cognitiveinsight session that appends composite cognitive insights and userfeedback to an existing cognitive session graph associated with theuser.

In various embodiments, the cognitive session token enables thecognitive application to interact with a cognitive session graphassociated with the cognitive session token. In these embodiments, thecomposite cognitive insight session is perpetuated. For example, a givencomposite cognitive insight session may last months or even years. Incertain embodiments, the cognitive session token expires after apredetermined period of time. In these embodiments, the cognitivesession token is no longer valid once the predetermined period of timeexpires. The method by which the period of time is determined, andmonitored, is a matter of design choice.

Contextually-relevant composite cognitive insights are then generatedand presented, as described in greater detail herein, to the user instep 910. In various embodiments, composite cognitive insights presentedto the user are stored in the cognitive session graph associated withthe cognitive session token. A determination is then made in step 912 iffeedback is received from the user. If not, then a determination is madein step 924 whether to end composite cognitive insights and feedbackoperations. If not, then a determination is made in step 926 whether thecognitive session token for the target session has expired. If not, thenthe process is continued, proceeding with step 910. Otherwise, theprocess is continued, proceeding with step 904. However, if it isdetermined in step 924 not to end composite cognitive insights andfeedback operations, then composite cognitive insights and feedbackoperations are ended in step 928.

However, if it was determined in step 912 that feedback was receivedfrom the user then the cognitive application provides the feedback, asdescribed in greater detail herein, to the cognitive platform in step914. A determination is then made in step 916 whether to use theprovided feedback to generate contextually-relevant questions forprovision to the user. If not, then the process is continued, proceedingwith step 924. Otherwise, the cognitive platform uses the providedfeedback in step 918 to generate contextually-relevant questions. Then,in step 922, the cognitive platform provides the contextually-relevantquestions, along with additional composite insights, to the cognitiveapplication. In turn, the cognitive application provides thecontextually-relevant questions and additional composite cognitiveinsights to the user in step 922 and the process is continued,proceeding with step 912.

Although the present invention has been described in detail, it shouldbe understood that various changes, substitutions and alterations can bemade hereto without departing from the spirit and scope of the inventionas defined by the appended claims.

What is claimed is:
 1. A system comprising: a processor; a data buscoupled to the processor; and a non-transitory, computer-readablestorage medium embodying computer program code, the non-transitory,computer-readable storage medium being coupled to the data bus, thecomputer program code interacting with a plurality of computeroperations and comprising instructions executable by the processor andconfigured for: receiving streams of data from a plurality of datasources; processing the streams of data from the plurality of datasources, the processing the streams of data from the plurality of datasources performing data enriching to provide data enriched data streams;and, storing the data streams and the data enriched data streams withinthe universal knowledge repository as a collection of knowledgeelements.
 2. The system of claim 1, wherein the instructions executableby the processor further comprise instructions for: generating acognitive insight based upon the collection of knowledge elements storedwithin the universal knowledge repository.
 3. The system of claim 1,wherein: the data enriching comprises identifying at least someknowledge elements within the collection of knowledge elements as atleast one of facts, opinions, descriptions, and skills.
 4. The system ofclaim 1, wherein: the data enriching comprises identifying at least someknowledge elements within the collection of knowledge elements as atleast one of statements, assertions, beliefs, perceptions, preferences,sentiments, attitudes and opinions and associating identified at leastsome knowledge elements with an entity responsible for generating the atleast one of statements, assertions, beliefs, perceptions, preferences,sentiments, attitudes and opinions.
 5. The system of claim 1, wherein:the universal knowledge repository comprises a private knowledgerepository; and, at least some of the plurality of data streams areprivate data streams.
 6. The system of claim 1, wherein: the universalknowledge repository comprises a universal cognitive graph.
 7. Anon-transitory, computer-readable storage medium embodying computerprogram code, the computer program code comprising computer executableinstructions configured for: receiving streams of data from a pluralityof data sources; processing the streams of data from the plurality ofdata sources, the processing the streams of data from the plurality ofdata sources performing data enriching to provide data enriched datastreams; and, storing the data streams and the data enriched datastreams within the universal knowledge repository as a collection ofknowledge elements.
 8. The non-transitory, computer-readable storagemedium of claim 7, wherein the instructions executable by the processorfurther comprise instructions for: generating a cognitive insight basedupon the collection of knowledge elements stored within the universalknowledge repository.
 9. The non-transitory, computer-readable storagemedium of claim 7, wherein: the data enriching comprises identifying atleast some knowledge elements within the collection of knowledgeelements as at least one of facts, opinions, descriptions, and skills.10. The non-transitory, computer-readable storage medium of claim 7,wherein: the data enriching comprises identifying at least someknowledge elements within the collection of knowledge elements as atleast one of statements, assertions, beliefs, perceptions, preferences,sentiments, attitudes and opinions and associating identified at leastsome knowledge elements with an entity responsible for generating the atleast one of statements, assertions, beliefs, perceptions, preferences,sentiments, attitudes and opinions.
 11. The non-transitory,computer-readable storage medium of claim 7, wherein: the universalknowledge repository comprises a private knowledge repository; and, atleast some of the plurality of data streams are private data streams.12. The non-transitory, computer-readable storage medium of claim 7,wherein the universal knowledge repository comprises a universalcognitive graph.
 13. The non-transitory, computer-readable storagemedium of claim 7, wherein the computer executable instructions aredeployable to a client system from a server system at a remote location.14. The non-transitory, computer-readable storage medium of claim 7,wherein the computer executable instructions are provided by a serviceprovider to a user on an on-demand basis.